Two new estimation methods for a superpositional intonation model
نویسندگان
چکیده
This work presents two new approaches for parameter estimation of the superpositional intonation model for German. These approaches introduce linguistic and paralinguistic assumptions allowing the initialization of a previous standard method. Additionally, all restrictions on the configuration of accents were eliminated. The proposed linguistic hypotheses can be based on either tonal or lexical accent, which gives rise to two different estimation methods. These two kind of hypotheses were validated by comparison of the estimation performance relative to two standard methods, one manual and one automatic. The results show that the proposed methods far exceed the performance of the automatic method and are slightly beyond the manual method of reference.
منابع مشابه
Intonation modeling of Mandarin Chinese using a superpositional approach
The intonation model is an important component in text-tospeech systems to obtain natural and expressive speech synthesis. In this paper we propose a superpositional model for Mandarin Chinese. The intonation model is composed of the syllable and the phrase component. The parameters of the model are estimated using JEMA, a training approach with many advantages related to robustness and precisi...
متن کاملThe Copasul Intonation Model
A new data-driven and linguistically interpretable intonation model for the automatic analysis and synthesis of fundamental frequency contours is introduced: the CoPaSul model, which provides a contour-based (Co), parametric (Pa), and superpositional (Sul) intonation representation. Its application in F0 analysis and generation is described as well as its linguistic anchoring with respect to se...
متن کاملEstimating speaker-specific intonation patterns using the linear alignment model
Modeling speaker-specific intonation is important in several areas, including speaker identification, verification, and imitation using text-to-speech synthesis. However the choice of the intonation model and the estimation of its parameters from spontaneous speech remains a challenge. We propose a way to estimate speaker-specific intonation parameters for a particular superpositional model, th...
متن کاملEstimating phrase curves in the general superpositional intonation model
Superpositional intonation models posit that the pitch contour, , can be quasi-additively decomposed into component curves such as phrase curves, accent curves, and segmental perturbation curves. Currently, these component curves can only be estimated if one assumes a specific superpositional model, such as the Fujisaki model. A method is proposed for estimating phrase curves that is model-inde...
متن کاملFoot-based Intonation for Text-to-Speech Synthesis using Neural Networks
We propose a method (“FONN”) for F0 contour generation for text-to-speech synthesis. Training speech is automatically segmented into left-headed feet, annotated with syllable start/end times, foot position in the sentence, and the number of syllables in the foot. During training, we fit a superpositional intonation model comprising accent curves associated with feet and phrase curves. We propos...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010